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Abstract 

The extremal tail probabilities of moving sums in a marked Poisson random field is 
examined here. These sums are computed by adding up the weighted occurrences of events 
lying within a scanning set of fixed shape and size. Change of measure and analysis of 
local random fields are used to provide tail probabilities. The asymptotic constants are 
initially expressed in a form that seems hard to evaluate and do not seem to provide any 
additional information on the properties of the constants. A more sophisticated approach is 
then undertaken giving rise to an expression that is not only neater but also able to provide 
computable bounds. The technique used to obtain this constant can also be modified to work 
on continuous processes. 
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1. Introduction. The maxima of moving averages in Gaussian random fields in di- 
mension d > 1 was studied in Siegmund and Worsley (1995) and Shane, Sigal, Siegmund 
and Worsley (2003), with applications in imaging and signal detection. Two key techniques 
used are (i) the Karhunen-Loeve expansion with the volume of tube formula and (ii) the 
Euler characteristic; see Adler (2000) for an overview of the research area and also Taylor, 
Takemura and Adler (2005) and Taylor (2006) for more recent developments. 

The maximum of moving sums in Poisson random fields, more commonly known as 
scan statistics in the statistical literature, also have widespread applications in molecular 
biology, epidemiology, geostatistics and image analysis, cf. Cressie (1993), Anderson and 
Titterington (1997), Glaz, Naus and Wallenstein (2001) and Chan and Zhang (2007), but the 
tail probability approximations are in comparison not as well developed for d > 1. While the 
tail probabilities of these sums have been studied in Naus (1965), Loader (1991) and Aim 
(1997), restrictions to rectangular scanning sets have been imposed for analytical convenience. 

We set out here to study the tail probabilities of the maxima of moving sums with 
minimal restrictions on the choice of scanning sets. A theory parallel to the study of tail 
probabilities in Gaussian or Gaussian-like random fields in the classical framework of Pickands 
(1969), Bickel and Rosenblatt (1973), Quails and Watanabe (1973), Piterbarg (1996) and 
Chan and Lai (2006) is developed here. We also consider a more general marked Poisson 
random field, which is motivated by recent developments in molecular biology, see for example 
Chan and Zhang (2007). This generalization entails careful consideration of overshoots in 
special cases of scanning sets that is not required in Poisson random fields. Berman (1982) 
and Albin (1990) have also studied tail probabilities of stationary processes but their limiting 
results are of a different type and do not apply here. 

The first expression of the tail probability is stated in Theorem 1 in Section 3. Lemma 
1 is the basic building block of Theorem 1, providing the extremal tail probability over a 
local domain by using a change of measure approach. The expression of this tail probability 
requires a description of an induced local random field around the boundary of the scanning 
set and this is provided in Section 2. The technical details of how these building blocks can 
be combined together to provide the tail probability of the maxima of the sums over the 
whole domain, via an adaption of the Pickands-Qualls- Watanabe technique, is given in the 
Appendix. In Section 4, we provide an alternative expression of the asymptotic constants in 
Theorem 1 via a more refined technique and obtain bounds of these constants. In Section 
5, we adapt this technique on continuous valued random fields and show that it provide 
constants that looks like a differential form of the constants obtained via the beautiful Poisson 
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clumping heuristic shown in Aldous (1989). Some bounds obtained from the new expression 
are surprisingly accurate. 

2. Definitions, notations and a local Poisson random field. Let D and B 

be Jordan-measurable (bounded) subsets of R rf . For vectors t = (ii,...,td) and u = 
(u±, . . . ,Ud), we shall use the notation t y u to denote tj > Uj for all j. We shall also 
let = (0, ... , 0) and 1 = (1, . . . , 1). Let Ofc(-), k = doik = d — 1 denote the A;-dimensional 
volume of a /c-dimensional manifold in R d . For any A C R d , i £ R and c G R d , we shall let 
j^A denote the number of elements in A and c + bA = {c + ba : a G A}. We shall also use 
|| • || to denote L2-norm and || • ||oo to denote Loo-norm. 

Assume that the boundary dB can be expressed as a finite union of smooth (d — 1)- 
dimensional submanifolds possibly with boundary (see Spivak (1965) pi 13 for the definition). 
For example, if B = {t : ||t || oo < l}i a cube of length 2, then dB is a union of 2d faces, each 
a smooth (d — l)-dimensional submanifold with boundary. 

Let X = {(ti,Xi) : i > 1} be a marked point process on R d+1 , characterized by F, a 
distribution function of the marks Xi and A > 0, the rate of events occurring. Hence for any 
set A, Borel subset of R d+1 , #{i : (tj,Xj) G A} follows a Poisson distribution with mean 
A J A dt x dF{x). Moreover, for any two disjoint Borel sets A and C, #{i : (tj,Aj) G A} and 

: (ti,Xi) G C} are independent random variables. From the above description, we may 
assume without loss of generality that X\, X2, ■ ■ ■ are independent and identically distributed 
(i.i.d.) random variables having distribution F and independent of {tj : i > 1}; a Poisson 
process with rate A. 

Let n = EXi and M(9) = Ee 6Xl . Assume that P{X X > 0} > and 9 := {6 : 
M{9) < 00} is an open neighborhood of 0. For any set A, Borel subset of R d , define the sum 
S(A) = Ylti^A-^i- We analyze here the tail probability 

(2.1) Px :=P x {supS(v + B)>\c} 

vGD 

as A — > 00, for given c > max{0, fiad(B)}. 

Through an appropriate transformation, we can also look at the limiting probability 
of p\ as one involving fixed Poisson rate Ao > and increasingly large scanning sets. Let 
9\ = (VAo) 1 ^ De the scaling constants. Then p\ = P\ {sup vegxD S(y + g\B) > Ac}. For 
notational simplicity, the analysis here looks at p\ in terms of (2.1) but the presence of such 
transformations has important practical implications. 

We will now proceed with the description of a limiting local random field Y = {Y(u) : 
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u G R}, that is derived from both the distribution F and the geometry of the boundary dB. 
For a given c > max{0, fiad(B)}, let 9 C > and distribution F c satisfy 

(2.2) M'(0 C ) = c/a d {B) and F c (dx) = e 9cX F(dx)/M(9 c ), 

where ' here denotes first derivative. 

Let = {(vj 1 -*, z\ 1 ^) : i > 1} be a marked Poisson process such that {v^ : i > 1} is 
a Poisson process with rate 1 on the domain dB x [0, oo) and Z^\z^\ . . . are i.i.d. with 
distribution F. Let TP^ 1 = {(vj 2 -*, Z^) : i > 1} be a marked Poisson process independent of 
ZW, such that {v^ : z > 1} is a Poisson process with rate M(9 C ) on the domain dBx (— oo, 0) 
and z[ 2 \ Z^ 2 \ ■ ■ ■ are i.i.d. with distribution F c . Let n t be the unit normal vector of t € dB 
away from B and let • denote dot product. For u £ R d , let 

yO')(u) = ^ 4 j) for j = 1,2, where 

(2.3) 



^ } = (J t x [0, n t • u) and A {2) = [j t x [n t • u, 0). 

te9-B:n t -u>0 tG9B:n t u<0 

We define 

(2.4) Y (u) = y (u) - Y (2) (u) for all u € R d . 



3. First expression of asymptotic tail probability. A key idea here is a change 
of measure argument that allows us to obtain, in Lemma 1, the tail probability of the max- 
ima over a local domain. To obtain the global probabilities in Theorem 1 from these local 
probabilities, we adapt the Pickands-Quall-Watanabe technique from the Gaussian random 
field literature. Hence the characterization of the constant K in Theorem 1 bears a striking 
resemblence to constants seen in the earlier papers on Gaussian random fields though the 
distribution of Y(u) here is compound Poisson rather than Gaussian. 

Let Q\ be a probability measure under which X is a nonhomogeneous marked Poisson 
process with rate \M(9 C ) and mark distribution F c inside B, and rate A and mark distribution 
F outside B. Hence under Q\, for any set A, Borel subset of R d+1 , #{i : (tj, Xj) G A} follows 
a Poisson distribution with mean \M{0 C ) Ja^bxR.) ^ x dF c {x)+\ fAn(B c xH) ^ x dF(x) while 
#{i : (ti,Xi) e A} and #{i : (tj,Xj) G C} are independent random variables for disjoint 
sets A and C. By (2.2), 

(3.1) ^Er(X) = e W {9 c S(B) - \a d {B)[M(9 c ) - 1]}. 
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Let E t ,m,\ = {suPt^v^t+mA- 1 ! S(y + B) > Ac}. In the proof of Lemma 1 below, we analyze 
the event £b,m,A under Q\ before applying the identity P\(£b,m,A) = Eq x [{dP\/ dQ\)lE m X }- 
We shall now define some terms required for the statement of Lemma 1. 

For given c > max{0, ^ad(B)}, let 

(3.2) I(=I c ) = e c c-a d (B)[M(e c )-l]. 

It follows from Theorem 1 below that / = — lim^oo A -1 logp\ and hence I is the large 
deviation rate of the tail probability. If there exists 77 > such that F is concentrated on 
±77, ±277, . . ., then we say that F is arithmetic. The largest 77 with this property will be called 
the span of F, cf. Feller (1971) Section 5.2. If such 7/ does not exists, then we say that F is 
nonarithmetic. Let |_-J denote the greatest integer function and " the second derivative of a 
function. 

Lemma 1. Let c > max{0, fxa^B)}. Define x\ = 6 c (Xc — r][Xc/r]\) if F is arithmetic 
with span 77 and x\ = if F is nonarithmetic. Then for all t G D, 

(3.3) Px(E t , m ,x) = Px(E , m ,x) ~ [2K\o- d (B)M"(6 c )]- l l 2 e- XI+x >>K m as A - 00, 

where 
(3.4) 



K m = < 



7? 



+ X^^e??z+ e -P{ su Po-<u-<mi ^( u ) — ^} ^/F ^ s arithmetic with span 77, 



-iu-imi ^( u ) ^ y}^?/ is nonarithmetic. 



PROOF. By stationarity, P\(E tj m : \) = P\{Eo :m: \). Let us first consider the case F 
arithmetic with span 1. Then 

P\(E , m ,x) = Px{ sup S(v + B) > Ac} 

(3.5) 00 

=P X {S(B) > LAcj} + Y,Px{S(B) = [Xc\ - £, sup [5(v + 5) - 5(B)] > £}. 

Let B a = floral ( v + B). Since S(B \ B mX -i) and sup^^-i^v + B) - S(B)} are 
functions of the marked Poisson process occurring outside B mX -i and hence independent of 
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S(B mX -i) under Q x , it follows from (3.1) that 

P X {S(B) = [Xc\ - e, sup [S(v + B) - S{B)} > £} 

= e- XI+x ^ e Q x {S(B) = [Xc\ - £, sup [S(v + B) - S{B)\ > £} 

(3.6) 

= e-^+^+^^Q A {5( J B mA - 1 ) = [Xc\ -l-k} 

xQ x {S(B\B mX - 1 )=k, sup [S(v + B)-S(B)}>£}. 

It follows from the local central limit theorem that for each £ G Z, 

QxiSiB^) = [Xc\ - l-k) <[l + o(l)]Q x {S(B) = [Xc\ - £} 

~[2KXa d {B)M" {6 C )Y 1/2 as A -> oo, 

uniformly over k > 0, with < replaced by = if we look at (3.7) with k fixed. Hence by (3.6) 
and (3.7), 

P X {S(B) = [Xc\ - £, sup [5(v + B) - S{B)\ > £} 

- [2^Aa ( i( J B)M // (0 c )]- 1 / 2 e- A/+ ^+^Q A { sup [S(v + B) - S(B)] > £}. 

O^v-imA- 1 ! 

By (3.8) and the weak convergence of {S'(A _1 u + B) — S(B) : -< u -< ml} under Q x to 
{Y(u) : -< u -< ml} as A -> oo; see (2.3) and (2.4), 

oo 

P A {S(5) = LAcJ - £, sup [5(v + B) - S(B)} > £} 

(3-9) 

~ e~ XI+x * [2irXa d (B)M"(e c )]~ 1 / 2 SU P Y ( u ) > 

By a similar application of (3.1) and (3.7), 
(3.10) Px{S(B) > [Xc\} ~ [27rA < 7 d (5)M"(0 c )]- 1 /2 e -A^ ^ e M 

^=-oo 

Substitution of (3.9) and (3.10) into (3.5) then proves Lemma 1 when F is arithmetic with 
span 1. For F arithmetic with arbitrary span 77, we prove Lemma 1 by replacing the sums 
in (3.5), (3.6), (3.9) and (3.10) by E^z+> T,k>o,ke v z or ^2e<o,eer,z- For nonarithmetic F, 
the sums are replaced by corresponding integrals. The detailed arguments are similar to the 
proof above and shall be omitted. □ 



i CMu^ml 
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Theorem 1. Let c > max{0, fiad(B)} and define x\ as in Lemma 1. Then 

(3.11) K := lim m~ d K m is a well-defined positive and finite constant. 

Moreover, 
(3.12) 

px = P A {sup 5(v + B) > Ac} ~ [2^a d {B)M"{9 c )]- 1 / 2 e- XI+x ^\ d - 1 / 2 a d {D)K as A ->■ oo. 

Remarks. By Jordan measurability of D, 
(3.13) 

d d 

#|k G (aZ) d : JJ[kj, % + a) C £>} ~ #{k G (aZ) d : \\[kj, kj + a) n D / 0} as a -> 0. 

J=l 3=1 

The relation (3.12) still holds if D is replaced by domains D\ that depends on A, provided 

(3.13) holds with D replaced by D\ and a replaced by mA^ 1 , with limit A — > oo for all large 
m, and 

(3.14) lim A- 1 log[(7 d (£) A )]=0. 
Without condition (3.14), the correct relation is 

P x { sup 5(v + B) > Ac} ~ 1 - exp{-[27r ( r d (S)M // (0 c )]- 1 /2 e -A/+^ ^-1/2^(^)^1 
veD A 

We will now discuss an interesting case of Theorem 1. In Example 1, we consider 
rectangular scanning sets on a marked Poisson random field. We show here that an overshoot 
constant derived from F plays an important role in the tail approximations. When F is 
degenerate at 1, that is for Poisson random fields rather than marked Poisson random fields, 
the overshoot constant is equal to 1 and disappears from the resulting formula. 

Example 1. Let B = Ylk = i[0,b k ] with b k > for all k. Since dB is a union of 
2d faces, with a pair of them orthogonal to each co-ordinate vector, by (2.3) and (2.4), 

= Tf k =i[ Y k ] '(«*) " ifc 2) («*)]. where ■ ■ ■ . Y d l \ Y ?\- ■ ■ . y i 2) are independent one- 
dimensional compound point processes. The process 1 < k < d, is constructed from 
a marked Poisson process having Poisson rate Yie^k ^ * ne surface area of the face of B 

(2) 

orthogonal to the kth. co-ordinate vector, and mark distribution F. The process Y" fe v , 1 < 
k < d, is constructed from a marked Poisson process with Poisson rate M{9 C ) fle^k ^ an< ^ 
mark distribution F c . If X is a random variable with distribution F c , we shall let F c denote 
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the distribution of —X. We define F in a similar manner. Consider first F nonarithmetic 
and let Y k = Y^-Y^. Then by (3.4) and P{sup ^ ml [ZLi Y k {u k )] > y} = 1 for y < 0, 
(3.15) 

K m = f°° e 9 *p{ sup \Y d Y k {u k )]>y\dy 

/OO f'tf ^ ^ 

(/ e e * u du)p\ sup [VnK)| Gdy) =^ 1 J BexpkV sup Y k (u k ) 
-oo i/ — oo 7 l 0^ml L ^ J > L ^0<tt fc <m J 

^ ^ />oo 

^n^exp^ sup y fc (« fc )] = et 1 H / SU P Yk(u k )>y}dy. 

k —\ 0<Uk<m k —\ ~°° 0<Ufc<m 

Since Y fc (1) and Y fc (2) are independent compound Poisson processes, it follows that Y k (u k ) = 
J2f=[ Uk ^ U k j, where N k is a Poisson process with rate dl^fc M[l + -^(^c)] and L^i, Lfo, • • • 
are i.i.d. random variables independent of N k such that 

(3.16) P{U kl G du} = [M(0 c )F c (du) + F(d«)]/[1 + M(0 C )]. 

Let P* be a probability measure under which the distribution of N k is unchanged and 
U k i, U k 2, ■ ■ ■ are i.i.d. random variables independent of N k satisfying 

(3.17) P*{U kl G du} = \F(du) + M(9 c )F c {du)]/[l + M(9 C )). 
By (2.2), (3.16) and (3.17), 

(3.18) (dP*/dP)(U kl ) = e 9 ° u * 1 . 

Suppressing the notation k, let = U\ + ■ • • + E/^ and r y = inf{£ > 1 : Ri > y}. Define the 
overshoot constant 

(3.19) v c = lim E^e~ 6c( - RT y~ y \ 

where E* denotes expectation with respect to P*. See Siegmund (1985) Chapter 8 for the 

existence and computation of v c . By (3.17)-(3.19), 

(3.20) 

/OO /'OO 
e**P{ sup Y k (u k ) > y}dy = 9' 1 + / ^^^^^{supo^^y.K)^}]^ 
-oo 0<u k <m JO ~ k ~ 

~ v c E*[ sup Ifc(tifc)] ~ v c m[c(J d l (B) - fi] fy, 

noting that by (2.2), under F c , E C X X = M'{6 C )/M{6 C ) = c/[a d (B)M{0 c )} and by definition, 
under F, EX 1 = /x. Substituting (3.20) into (3.15) and (3.11), (3.12) then gives us 

(3.21) Px ~ [27ra (i ( J B)M"(0 c )]- 1 /2 e -A/+^ A rf-i/2^ (jD){z , c[c(Jrf i (jB) _ TJ 6fe ^\ 
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c 


2 


3 


4 


5 


6 


7 


8 


9 


10 


oo 


(I) F concentrated at 1 


.160 


.497 


.818 


1.08 


1.29 


1.47 


1.62 


1.75 


1.85 


3.12 


Lower bound of (I) 


.0235 


.0795 


.137 


.188 


.232 


.268 


.299 


.325 


.348 


.636 


(II) F ~ iV(0,l) 


.153 


.324 


.495 


.654 


.797 


.929 


1.04 


1.15 


1.25 


3.12 



Table 1: Entries of Kj (1 + c) 2 for the second row and K9 c /[l + M(6 c )} 2 for the fourth row for 
the kernel B = {t : ||t|| < 1} with d = 2. These numbers have an approximate 1% numerical 
error. The third row is obtained from the inequality K > 2(c — l) 3 /[7r(l + c)], see Example 



where Xc = &c when F is nonarithmetic. Using similar arguments, the relation (3.21) can also 
be shown to hold for F arithmetic with span rj, by defining v c in (3.19) with limit y £ r/Z, 
y — ► oo and Xc = f? -1 (l — e~ v0c ). 

4. An alternative approach. The evaluation of the constant K in Example 1 for 
rectangular kernels follows along the lines of Hogan and Siegmund (1986). However, when 
B is not rectangular, the expression of K via (3.4) and (3.11) does not seem to be helpful 
except for indicating how the proofs of Lemma 1 and Theorem 1 is expected to proceed. 
This is unsatisfying since kernels of other shapes are often used in practice. For example, in 
epidemiology and geostatistical applications, the circular kernel B = {{t\,t2) ■ t\ + 1| < 1} 
provides a more desirable co-ordinate free space symmetry. For space-time problems, the 
corresponding kernel is the cylindrical scanning set B = {(t±, £2^3) : t\ + 1| < L 1^1 < !}• 

To search for an alternative formulation of K, it is best to start with the special case F 
concentrated at 1, for which the identity 

(4.1) ^^'({ueR^fu)^})] 

holds. This identity looks surprising initially because the right hand side involves only the 
occupation measure of the conditional process Y at and does not seem to be related to the 
maxima of Y. Is (4.1) true for general F ? Before answering this question, we first show how 
(4.1) can be utilized to provide a lower bound for K. 

Example 2. Let F be concentrated at 1. By (4.1), K > {^[^({u : Y(u) = 0})]} -1 . 
Let B d = {t : ||t|| < 1} with t G K d and let C d = <J d -i{dB d ) / aj^B^) = [d^ d l 2 /T{d/2 + 
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l)]/[7r( d - 1 )/ 2 /r((d + l)/2)] d . Then 

POO 00 

E[a d ({u : Y (u) = 0})] = C d / r^ 1 V e^ 1 ^ [c k r 2k / \kl) 2 } dr 

JO h n 

(4-2) 

= C d J2 c k T(2k + d)/[(fc!) 2 (l + c) 2k+d ] = [C d /{1 + cflft'H^/V + c)), 

fc=0 

where /^(x) = a// \/l — 4x 2 and is the £th derivative of a function g. Similar computations 
can also be carried out for kernels of other shapes. 

Theorem 2. Let c > max{0, /j,a d (B)}. Then 

(4.3) K = Xc l E{[{\ - exp(# c sup{u G R d : Y(u) < 0})}a~ x ({u G R d : Y(u) = 0})} 

where Xc = — e~ v0c ) if F is arithmetic with span rj and Xc = ®c if F is nonarithmetic. 

Let {(t(i),yi) : % > 1} be a unit rate Poisson process defined on <9f? x [0, 00) and define 
the random set 

!J = {u£ R d : n t (j) • u < or j/j(n t (j) • u) > ||u|| 2 for all i}. 

Then [1 + M(9)] d a d ({u : Y(u) = 0}) =4- ^(Q) as c -»■ 00. Hence we obtain the following. 

Corollary 1. If F is nondegenerate, then K/{xc 1 [^ + M(6 c )] d } is bounded above by 
-Efcr^ 1 ^)] for all c and tends towards Ela^ 1 ^)] as c — > 00. If F is degenerate at n > 0, 
then K/[l + M(# c )] rf is bounded above by nEla^ 1 (tt)] for all c and tends toward nEla^ 1 (fi)] 
as c — > 00 . 

The case F degenerate at 77 stands out because sup{Y(u) : Y(u) < 0} = —n with 
probability 1. Note also that f2 depends only on the kernel B and not F. 

PROOF of Theorem 2. Let us first consider F arithmetic with span 1. To simplify 
notations, select A such that x\ = (i.e. A G Z/c). We shall also abuse notation here and 
write S(v) in place of 5(v + B). By the change of measure argument in the proof of Lemma 
1 and the probability bounds obtained in Lemmas A.l and A. 2, for any integers < k < £, 
t in the interior of D and dw G (0, 00), 
(4.4) 

P x \ sup 5(v) = Ac, S(t) = Xc-k, a d ({v : S(y) = Ac - k}) G X~ d dw, 

sup{,S(v) : 5(v) < 5(t)} = Ac - ^} ~ [27rA^(i?)M"(^)]- 1 / 2 e - A/+ ^V (i ( J D) 

x P{ sup y(u) = k, a d {{u : y(u) = 0}) G dw, sup {F(u) : Y(u) < 0} = k - i\. 
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Multiplying (4.4) by X d (e 9ck — e 9ct )/w and integrating over t £ D and w > 0, we obtain 

(e _flc * - e- 0ct )P x \ sup S(v) = Ac, <r d ({v : 5(v) = Ac - fc}) > 0, 
veD 

a d ({v : 5(v) = Ac - £}) > 0, a d ({v : Ac - I < S(v) < Ac - fc}) = o) 

(4.5) J 

~[2vra (i ( J B)M /, (0 c )]- 1 / 2 e- A/ A d - 1 / 2 ( 7 d ( J D)(l - e <M*-<)) 

X E [ a d 1 ({ U : = 0})l{sup ueRd y(u)=fc,sup ueRd {y(u) : Y(u)<0}=A ; -< 

Theorem 2 then follows by adding up (4.5) over the integers < k < i and comparing against 

(4.6) Pj sup 5(v + B) = Ac) ~ [27r < 7 d (B)M // (0 c )]- 1 /2(i _ e" 9 ^ 1 \ d ~ l l 2 a d (D)K, 

a straightforward modification of Theorem 1. For F arithmetic with arbitrary span rj > or 
F nonarithmetic, the arguments are similar. □ 

5. A relook at the Poisson clumping heuristic. In this section, we consider a 
continuous valued random process AT(t), t G D. To make the discussion concrete, we pick 
the isotropic mean zero Gaussian random field X(t), t G R d satisfying 

(5.1) F[X(t)AT(t + s)] ~ 1 - a||s|| a as ||s|| 

for some < a < 2 and < a < oo. It was shown in Bickel and Rosenblatt (1973) and 
Quails and Watanabe (1973) that 

(5.2) P{sup X(t) > c} ~ {2Try 1 / 2 c 2d/a - 1 e' c2/2 a d/a K as c ^ oo. 

teD 

The approach is via a conditioning on X(t) = c— y/c for y > which leads to the expression 

____ POO 

(5.3) K = lim / e y P{ sup F(u) > y} dy, 
where Y is a Gaussian process satisfying 

(5.4) EY(u) = -||u|| Q , Cov(y(u),y(v)) = ||u|| a + ||v|| a - ||u - v|| a . 

Aldous (1989) using the Poisson clumping heuristic, conditioned instead on X(u) > c + y/c 
for y > and it follows from this approach that 

(5.5) K = E[a d - 1 ({Y(u):Y(u)>-Z})], 

where Z is an independent exponential random variable with mean 1. In Theorem 3, we 
apply the technique used to prove Theorem 2 to provide a differential form of (5.5). 
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Theorem 3. K = lim^ Jq E[a^({u : -b < Y(u) < £ - b})]db. 

Example 3. We shall provide lower bounds of K using the harmonic mean inequality 
as in J20 of Aldous (1989). Let B = {t : ||t|| < 1}. By Theorem 3, K > lim^ J^{E[a d ({u : 
-b < Y(u) < £ - b})]}- 1 db and hence 

~ f'°° 

K- 1 < V d -l{dB) / r rf-l( 4vrr «)-V2 e -r 2 V(4r") dr 

Jo 

This leads to the inequality 

(5.6) K > cTVi-W-^ar^ + l) /v(± - ±). 

In the case a = 2, Y has a simple characterization from which K = ■K~ d l 2 can be computed. 
For d = 2, the right hand side of (5.6) is tt^ 1 (=K) and for d = 3, it is 1/(4-^7?). 

Proof of Theorem 3. Let £ > 0, < v < £, X sup = sup teD X(t) and Y sup = 
sup ueR d Y(u). For any integer fe > 0, t in the interior of D and du> G (0, oo), 

^{^sup > c, X sup - [(A; + 1)£ - u]/c < X(t) < X sup - (fee - v)/c, 

a d ({u : X sup - [(k + 1)£ - «]/c < X(u) < X sup - (A£ - u)/c}) G (c 2 a)- d / Q <M 

~(27r)^/ 2 C -V c2 / 2 {( /" e -"d J /)p{fce-«<n up <(fc + 
LV J-(fcC-t>) 7 

a d ({u : y sup - [(fe + 1)£ - u] < Y(u) < y sup - (fee - «)}) G <M 

/-(fc£-i>) 
e-f P{y < y sup < (fe + 1)£ - v, 

a d ({u : y sup - [(fe + 1)? - c] < y(u) < Ysup - (A:? - c)}) G dw}dj/}. 

Multiply (5.7) by (e~ k ^ - e~( k+1 ^) /[(c 2 a)~ d l a w\, then integrating over t G D and to > and 
add over fe > 0. Then 

oo 

^ (e -fcS_ e -(fc+D? )P{Xsup > c} 

A;=0 

oo 

~(27r)-V^/«-i a -/« e ^/2{( e -_ e ^)^^[i w ^ op<(Jfc+1)c _ 1 , } 

fc=0 

x ^({u : y sup - [(fe + i)e -v}< y(u) < y up - (fee - «)})] + o € (i)}, 

where o 5 (l) -► as e 0. By (5.2), 
(5.8) 

£ = Hm / p^-^ju : ysup-(eL(^sup+^)/£J -v-o < k(u) < nup-(eL(nu P +«)Aj-«)})]d« 



12 



and Theorem 3 is shown. □ 
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Appendix: Proof of Theorem 1 

Let J = [0,l] d , = {k e (aZ) d : k + aJ C D} and C a = {k G (aZ) d : k + aJnD^ 0}. 
Then {k+aJ : k E C_ a } and {k+aJ : k 6 C a } are lower and upper coverings of D respectively 
by cubes of length a. We shall show via Lemmas A.l and A. 2 that 



(A.l) limlimsup[ V P x { M (£ u , m , A n E^ x )\ / {X^e^ 1 ) 



= 0. 



Let /(A) = [27r ( j d ( J B)M // (0 c )]- 1 /2A^- 1 /2 e -A/+x A Then by Lemma 1? 

(A.2) Px{E u ^ x ) ~ K m f(X)/\ d as A ^ oo. 

Given e > 0, let m e be large enough such that for all m > m e , the expression in the 
square brackets on the left-hand side of (A.l) does not exceed e for all large A. Then by 
(A.2), for all m > m £ , 



UmM[\- d (#C m/x )K m - 6A d - 1 / 2 e- A/ //(A)] < liminf \p x /f(X)} 

(A.3) A ^°° A ^°° 

< limsup[p A //(A)] < hmsup[A- d (#C , m/A )^ m ]. 

A^oo A^oo 



Since D is Jordan-measurable, 

(A.4) #C a ~ #C a ~ a- d a d (L>) as a - 0. 

Noting that liminf A _ >0O [p A //(A)] and \imswp x ^ 00 \p x / f(X)] are fixed real numbers and x x is 
bounded, it follows from (A.3) and (A.4) that m~ d K m is Cauchy. Hence K = linim^oo mT d K m 
exists and (3.12) follows from (A.3). 

We will now state and prove Lemmas A.l and A.2 before providing the complete proofs 
of both (A.l) and Theorem 1. To avoid repetitive arguments, we will state and prove all 
subsequent results assuming F is arithmetic with span 1. The modifications required to 
extend these results to arbitrary F are relatively straightforward and will not be discussed. 



Lemma A.l. 



(A.5) lim limsup \p x {S{B) < [Xc\ - r, sup S(v + B) > [Xc\}/(X- 1/2 e' XI ) 



r— +oo 



A^oo 



(Mv^A- 1 ! 



= 0. 
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Proof. By (3.8) with m = 1 and the weak convergence of S(X 1 u + B) - S(B) to Y(u) 
under Q\, 

P\{S(B) < [Xc\ - r, sup 5(v + B) > [Xc\} 

(A.6) oo 

~ [2nXo d {B)M"{e c )Y l l 2 e~ XI+x * V e^P{ sup Y(u) > as A oo. 

Let x + = max{x,0} and x~ = max{ — x,0}. It follows from (2.3) and (2.4) that 

(A.7) supY(u)<Z*:= £ \zf Y + £ I^l". 

o-<u^i .fr" ,r-^ 

where = dB x [0, d 1 / 2 ) and ^( 2 ) = 95 x [-cF 1 / 2 , 0). We can also express Z* = J2f=i Vj, 
where N is a Poisson random variable with mean k := 1 — F(0) + M(# c )F c (0) and V\, V2, . . . 
are i.i.d. random variables independent of N with g := Ee dVl < 00 for some 6 > 9 C . Since 
Ee §z* = e «(ff-i) ) it follows from Chebyshev's inequality that > £} < e K (9-i)-^. Hence 

by (A.7), 

00 

(A.8) V e ej P x { sup Y(u)>£} = 0(e K( - 9 ~ 1) - (§ - dc)r ). 

e=r +i 

Lemma 2 follows from (A.6), (A.8) and because x\ is bounded. □ 

Lemma A. 2. Let r > and L > be given. Then 
(A.9) 



lim limsup 

fc^oo x^oo 



£ > LAcj -r, S(v+i?) > LAcJ -r}/(A- 1 / 2 e " A/ ) 

ve(A- 1 Z) d :fcA- 1 <||v|| 00 <L 



= 0. 



PROOF. Let Qa(= Qa,v) be the probability measure under which the marked Poisson 
process X has Poisson rate XM(6 C ) on B 1 := B n (v + B), rate XM(6 c /2) on 73 2 := (B \ (v + 
.B)) U ((v + B)\B) and rate A elsewhere on R d . Moreover we require that under Q\, the 
marks have distribution F c on J5i, distribution F satisfying F(dx) = e ecX / 2 F(dx)/M(6 c /2) 
on B2 and F elsewhere on H d . Then 

(A.10) — (X) = ( ]J e^je"^^ ^^'" 1 ^ g^x^g-Aa^MM^)-^ 

Since M is a convex function and M(0) = 1, C := [M(0 C ) - 1] - 2[M(6» c /2) - 1] > 0. We can 
thus express (A. 10) as 

d ® X (X) - Je c /2)[S(B)+S( V +B)]-X[a d (B)+a d ( V +B)][M(d c )-l]/2+XCMB2)/2 
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and it follows from (3.2) and an analogue of (3.7) that 
(A.ll) 

P\{S(B) > [Xc\ - r,S{v + B) > [Xc\ - r} < P X {S{B) + 5(v + B) > 2{[Xc\ - r)} 
dP x 



dQ > 



l { 5( B)+S (v + B)> 2 (LA C j-,) } = 0(e-"-^ B \^ B »\-^). 



Let ||e|| = 1 and n e = {b — (e • b)e : b G B} the projected surface of B on a (d — 1)- 
dimensional hyperplane orthogonal to e. Then (3 := inf|| e || =1 crd_i(n e ) > 0. Hence there 
exists e > such that 

(A.12) a d (B \ (v + B)) > ||v||/3/2 > \\v\\ooP/2 for all HvlU < e. 

By (A.ll) and (A.12), it follows that 

£ Px{S(B) > [Xc\ - r, 5(v + B) > [Xc\ - r} 

^>fc 

Moreover, since a := inf|| v || oo>e Od{B \ (v + B)) > 0, it follows from (A.ll) that 

(A.14) P\{S(B) > [Xc\-r,S{v + B) > [Xc\ -r} = 0(X d - 1/2 e- XI '^ a ). 

ve(A- 1 Z) d :e<||v|| 00 <L 

Lemma 3 then follows from combining (A. 13) and (A.14). □ 

Proof of (A.l). Let e > 0. By Lemma A.l and stationarity, we can select r large 
enough such that 

(A.15) 7ujA := P x {S(u + B)< [Xc\ - r, sup 5(v + B) > [Xc\ } < eX'^e'^ 1 

u-;v^u+l 

for all large A. Let k = [m l / 2 \ , T m = {t G Z d : kl -< t -< (m - fc)l} and ft m = {t G Z d : -< 
t -< ml}\r m . Then 

£ p 4 U (^n«w AJ )} 

(A. 16) _mA _mA 

< £ ^ U ' A+ E ^A(G u , m ,A n G w , m , A ) + £ P*( H u,m,\), 

where 

(A.17) G u , miA = P x {S(v + B)> [Xc\ - r for some v G u + A _1 r m }, 
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(A.18) # u ,m,A = P\{S{v + B)> [Xc\ - r for some v G u + A _1 il m }. 

By (A.4) and (A.15), 

(A.19) Yl 7u,x <[e + o{l)]\ d - l ' 2 e- XI a d {D). 

Let L > £>uPx,ygD ||x — y||oo- Then by (A. 17), stationarity and Lemma A. 2, there exists ttl 
large enough such that for all u 6 Q_ m \-i and large A, 

u,m,\ H G W]mi x) 

w6C mA _i:w^u 

<m d Y P\{S(B) > [Ac] - r, 5(v + B) > [Xc\ - r} < [e + o{l)\m d \- l / 2 e- xl . 

ve(\- 1 Z) d :k\- 1 <\\v\\ oc <L 

Hence by (A.4), 

(A.20) Yl p x(Gu,m,x n G w , ro , A ) < a d (D)[e + o^lA^/V^. 

u,w6C mA _i:w^u 

Since k = L^ 1/2 J, = (m + l) d - (m + 1 - 2/c) d = O^' 1 / 2 ) and it follows from (A.4) 

and a modification of (3.10) (with Yle=-oo instead of Y^=-oo) that 

(A.21) Y P ^ H u, m ,x) = Oim^iXm^fa^X-^e^ 1 ) < eX^e'" a d (D) 

for all large m. We then obtain (A.l) from (A. 16) and (A.19)-(A.21) by choosing e arbitrarily 
small. □ 

Proof of Theorem 1. By the arguments in the beginning of Section 4, it remains to 
show that K is positive and finite. By Lemma A.l, there exists r large enough such that 

(A.22) P\{S{B) < [Xc\ - r, sup 5(v + B) > [Xc\} < X~ 1/2 e - XI 

for all large A. Moreover, by a modification of (3.10) (with Y7e=-oo instead of X^=-oo)> 
(A.23) P\{S(B) > [Xc\ -r} = 0(X- 1/2 e~ XI ). 

By adding up (A.22), (A.23) and applying Lemma 1, we can conclude that K\ < oo. Then 
by (3.12), (A.3) and (A.4), K < K x < oo. 

Next, select e small enough such that 5 := [2ira d (B)M" '{9c)]- 1 / 2 E°=-oo ^ ~ e > °- B y 
Lemma A. 2 with r = and L > sup X;ye£ , ||x — y||oo, there exists k large enough such that 

(A.24) Y P\{S{B) > [Xc\ , 5(v + B) > [Xc\ } < eX~ 1/2 e~ XI 

ve(fcA- 1 Z) d :0<||v|| oo <L 
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for all large A. Then by stationarity, (3.10) and (A. 24), noting that x\ > 0, 
P\{E , m ,x) >P\{S(u + B)> [Xc\ for some u G (£;A _1 Z) d , -< u -< mA _1 l} 
> (Px{S(u + B)>[Xc\} 

ue(k\- 1 Z) d :0-<u-<TnX- 1 l 

P i S ( u + B )> LAcJ , S(w + B) > [Xc\ }) 

we(fcA- 1 Z) d :0<||w-u|| oo <-L 

>(m/k) d 5X^ 2 e- XI for all large A, 

and by letting m —>■ oo with k fixed, it follows from (3.3), (3.11) and x\ bounded that K > 0. 
□ 

References 

Adler, R. (2000). On excursion sets, tube formulas and maxima of random field. Ann. 
Appl. Probab. 10 1-74. 

Albin, J.M.P. (1990). On extremal theory for stationary processes. Ann. Probab. 18 
92-128. 

ALDOUS, D. (1989). Probability Approximations via the Poisson clumping heuristic. Springer- 
Verlag, New York. 

Alm, S.E. (1997). On the distribution of scan statistics of a two-dimensional Poisson process. 
Adv. Appl. Probab. 29 1-18. 

Anderson, N.H. and Titterington, D.M. (1997). Some methods for investigating spatial 
clustering with epidemiological applications. JRSS 'A ' 160 87-105. 

Berman, S.M. (1982). Sojourns and extremes of stationary processes. Ann. Probab. 10 
1-46. 

Bickel, P. and Rosenblatt, M. (1973). Two-dimensional random field. In Multivariate 
Analysis III (P.R. Krishnaiah, ed.) 3-15. Academic, New York. 

Chan, H.P. and Lai, T.L. (2006). Maxima of asymptotically Gaussian random fields and 
moderate deviation approximations to boundary-crossing probabilities of sums of random 
variables with multidimensional indices. Ann. Probab. 34 80-121. 

Chan, H.P. and Zhang, N.R. (2007). Scan statistics with weighted observations. Jour. 
Amer. Statist. Assoc., 102 595-602. 

Cressie, N. (1993). Statistics for Spatial Data. Wiley, New York. 

Feller, W. (1971). An Introduction to Probability Theory and its Applications, Vol 2. 
Wiley, New York. 



17 



Glaz, J., Naus, J. and Wallenstein, S. (2001). Scan Statistics. Springer, New York. 

Hog AN, M. and Siegmund, D.O. (1986). Large deviations for the maxima of some random 
fields. Adv. Appl. Math. 7 2-22. 

Loader, C. (1991). Large-deviation approximation to the distribution of scan statistics. 
Adv. Appl. Probab. 23 751-771. 

Naus, J.I. (1965). Clustering of random points in two dimensions. Biometrika 52 263-267. 

PlCKANDS, J. (1969). Upcrossing probabilities for stationary Gaussian processes. Trans. 
Amer. Math. Soc. 145 51-75. 

Piterbarg, V. (1996). Asymptotic Methods in the Theory of Gaussian Processes and Fields. 
Transl. vol 148, Amer. Math. Soc, Providence. 

Qualls, C. and Watanabe, H. (1973). Asymptotic properties of Gaussian random fields. 
Trans. Amer. Math. Soc. 177 155-171. 

Shafie, K., Sigal, B., Siegmund, D and Worsley, K.J. (2003). Rotation space random 
fields with an application to fMRI data. Ann. Statist. 31 1732-1771. 

Siegmund, D.O. (1985). Sequential Analysis. Springer, New York. 

Siegmund, D.O. and Worsley, K.J. (1995). Testing for a signal with unknown location 
and scale in a stationary Gaussian random field. Ann. Statist. 23 608-639. 

Spivak, M. (1965). Calculus on Manifold: A Modern Approach to Classical Theorem of 
Advanced Calculus. W.A. Benjamin, New York. 

Taylor, J. (2006). A Gaussian kinematic formula. Ann. Probab. 34 122-158. 
Taylor, J., Takemura, A. and Adler, R. (2005). Validity of the expected Euler charac- 
teristic heuristic. Ann. Probab. 33 1362-1396. 



18 



